Development and validation of a nomogram for predicting Mycoplasma pneumoniae pneumonia in adults

The study aimed to explore predictors of Mycoplasma pneumoniae pneumonia (MPP) in adults and develop a nomogram predictive model in order to identify high-risk patients early. We retrospectively analysed the clinical data of a total of 337 adult patients with community-acquired pneumonia (CAP) and divided them into MPP and non-MPP groups according to whether they were infected with MP. Univariate and multivariate logistic regression analyses were used to screen independent predictors of MPP in adults and to developed a nomogram model. Receiver operating characteristic (ROC) curve, calibration curve, concordance index (C-index), and decision curve analysis (DCA) were used for the validation of the evaluation model. Finally, the nomogram was further evaluated by internal verification. Age, body temperature, dry cough, dizziness, CRP and tree-in-bud sign were independent predictors of MPP in adults (P < 0.05). The nomogram showed high accuracy with C-index of 0.836 and well-fitted calibration curves in both the training and validation sets. The area under the receiver operating curve (AUROC) was 0.829 (95% CI 0.774–0.883) for the training set and 0.847 (95% CI 0.768–0.925) for the validation set. This nomogram prediction model can accurately predict the risk of MPP occurrence in adults, which helps clinicians identify high-risk patients at an early stage and make drug selection and clinical decisions.


Materials and methods
Study population. We retrospectively collected clinical data from all adult patients diagnosed with CAP at Shanxi Bethune Hospital from January 2021 to December 2021. A total of 337 patients were finally included in the study and were divided into MPP group and non-MPP group according to whether they were infected with MP. The inclusion criteria were as follows: (1) age ≥ 18 years old and (2) fulfilled the diagnostic criteria for CAP in adults. CAP was defined as the presence of a new radiologic pulmonary infiltrate and the onset of symptoms with at least one of the following indicators: cough, sputum production, and dyspnoea; body temperature > 38.0℃; rales on auscultation; and a peripheral white blood cell count (WBC) > 10 × 10 9 /L or < 4 × 10 9 / L 17 . (3) The patient had positive results for MP. A definitive diagnosis of MPP is defined as a serological MPimmunoglobulin M (IgM) titre ≥ 1:160 or a fourfold increase in antibody titres during the acute and convalescent phases 18 . (4) The patient had complete clinical data. The exclusion criteria were as follows: (1) age < 18 years old; (2) infection with other pathogens; (3) pulmonary tuberculosis, bronchiectasis, bronchial asthma, chronic obstructive pulmonary disease and other pulmonary diseases; (4) severe community-acquired pneumonia; and (5) incomplete clinical data.
Estimates of the sample size were based on the principle of at least 10 outcome events per variable 19,20 , and our sample size was sufficient to yield valid results.
Data collection. Demographic, clinical, laboratory, and radiographic characteristics were collected within 24 h of admission. The demographic data included sex, age, and season of onset. The clinical data included fever, duration of fever (time from fever onset to hospitalization), body temperature, cough and sputum, dry cough, pharyngeal malaise, shortness of breath, aversion to cold, chills, fatigue, dizziness, headache, muscle soreness, and duration of hospital stay. The laboratory data included white blood cell count (WBC), absolute neutrophil count, lymphocyte count, platelet count (PLT), C-reactive protein (CRP), procalcitonin (PCT), erythrocyte sedimentation rate (ESR), and D-dimer. The radiographic characteristics were analysed independently by two experienced radiologists.
Ethics statement. This study was approved by the Ethics Committee of Shanxi Bethune Hospital. Written informed consent was obtained from a parent and/or legal guardian of each participant. This study was performed in accordance with the Declaration of Helsinki.
Statistical analysis. SPSS software version 22.0 (SPSS Inc.) was used to analyse the data. The median and interquartile range (IQR) were used for the quantitative data with a nonnormal distribution. The Mann-Whitney U test was used to compare the two groups. Percentage (%) and cases (n) were used for the enumeration data. The chi-square test (χ 2 test) was adopted for comparisons between the two groups. Binary logistic regression analysis was used to perform a multivariate analysis to obtain independent predictors of MPP in adults. The nomogram prediction model was constructed using the screened independent predictors. The nomogram was constructed and drawn using R software version 4.1.2 (https:// www.r-proje ct. org/). Multicollinearity was checked before determining the final model. The nomogram is based on the regression coefficients in the multiple logistic regression, with values ranging from 0 to 100 points, and is a visualization of the regression equation. Each parameter in the nomogram has a corresponding score, and the scores for each parameter are added to obtain the total score. Each total score corresponds to the probability of a clinical event occurring in a given patient 21,22 .
The discriminative ability and prediction accuracy of the nomogram were evaluated by the concordance index (C-index). A C-index is typically between 0.5, 0.5-0.7, 0.7-0.9 and > 0.9, which represents low, medium, high, and very high accuracy. The calibration curve was used to evaluate the actual and predicted risk of the MPP nomogram. The predictive power of the nomogram was assessed by the receiver operating characteristic (ROC) curve, and the area under the ROC curve (AUROC) was calculated. The clinical net benefit was assessed by the decision curve analysis (DCA) curve. Finally, bootstraps with 1000 resamples were used for internal validation. P < 0.05 was considered statistically significant.

Results
Patient characteristics. In this study, a total of 337 patients met the inclusion and exclusion criteria, including 112 patients in the MPP group and 225 patients in the non-MPP group. The patients were randomly divided into a training set (n = 236) and a validation set (n = 101) at a ratio of 7:3 23 Table 1. In the MPP group, the incidence was higher in the females than in the males, and the median age of patients was 33 years. In the non-MPP group, the incidence was higher in the males than in the females, and the median age of patients was 50 years. MPP was more likely to be present in the autumn and winter. There were statistically significant differences in sex, age, and season of onset between the two groups of patients (P < 0.05). www.nature.com/scientificreports/ In terms of the clinical manifestations, there were statistically significant differences in fever, duration of fever, body temperature, dry cough, pharyngeal malaise, shortness of breath, aversion to cold, chills, fatigue, dizziness, and duration of hospital stay between the two groups of patients (P < 0.05). In the laboratory results, the levels of CRP and D-dimer in the MPP group were lower than those in the non-MPP group, both of which were significantly different (P < 0.05). According to imaging examinations, the MPP group had a higher proportion of patients with tree-in-bud signs and a lower proportion of patients with pleural effusion than the non-MPP group, all of which were significantly different (P < 0.05).   The establishment and validation of the nomogram. According to the results of the multivariate analysis, we constructed a nomogram model including six predictors: age, temperature, dry cough, dizziness, CRP, and tree-in-bud sign (Fig. 1). The final model was validated internally by using the bootstrap method (1000 repetitions). The model had good precision and discrimination with a concordance index (C-index) of 0.837. As shown by the calibration curves, the calibration curves of the nomogram were highly consistent with the standard curves in the training  www.nature.com/scientificreports/ and validation sets (Fig. 2). The AUROC was 0.829 (95% CI 0.774-0.883) in the training set and 0.847 (95% CI 0.768-0.925) in the validation set (Fig. 3), indicating the high reliability of the nomogram's prediction ability.
Clinical utility of the nomogram was evaluated by DCA curves. The DCA curves of the nomogram were shown in Fig. 4. The net benefit of using the nomogram to predict MPP in adults was high when the threshold probability was between 0.02 and 0.71 in the training set (Fig. 4a) or between 0.01 and 0.83 in the validation set (Fig. 4b). Therefore, the nomogram had good clinical utility for predicting MPP in adults.  www.nature.com/scientificreports/

Discussion
MPP is a seasonal epidemic. A delayed diagnosis will increase the risk of infection in the surrounding people, and patients with severe MP infection may even need to be admitted to the intensive care unit (ICU), which affects the quality of life of patients. Therefore, it is critical to develop models for the early prediction of MPP in adults. In this study, which was a case-control study cohort of 337 patients, a nomogram model was developed to predict the risk of developing MPP in adults. The results showed that age, body temperature, dry cough, dizziness, CRP and tree-in-bud sign were independent predictors of MPP in adults. The nomogram based on these 6 factors showed good predictive performance.
Age was one of the independent predictors of MPP in adults in our study. In a nomogram study of RMPP in children, age was included, and the incidence of RMPP was positively associated with age 24 . However, there was no such association in adult MPP. MPP in adults is more likely to occur in young adults. The median age of the patients in this study was 33 years, similar to previous reports 25,26 . This may be related to the fact that young people have many social activities, and workplaces are mostly indoors with poor air circulation and are associated with people in close contact. At the same time, we found that women are more prone to MPP than men, and the specific reasons need to be further studied.
Fever, dry cough and dizziness are common clinical symptoms of MPP. Patients with MPP usually have different degrees of fever, and most of them are moderate to high 27 . We found that the average body temperature of the patients was 39 °C, which was consistent with previous studies 25 . Insufficient blood supply to the brain during fever in MPP patients may lead to dizziness. Dry cough is secondary to tracheobronchitis caused by the invasion of MP into the respiratory epithelium 28 . A study has reported that facial oedema, chest pain, chest tightness, and dizziness occur when patients with MPP cough violently 29 . In addition, dizziness in MPP patients may also be related to autoimmunity or the formation of immune complexes 8,30 . In our study, the combination of these clinical symptoms in predictive models offers a great potential advantage for predicting the occurrence of MPP in adults.
CRP is an acute-phase reactant that begins to be secreted 4-10 h after an inflammatory injury, peaks at 48 h, and has a half-life of 19 h. The magnitude of its increase was positively correlated with the inflammation severity. Dynamic monitoring of CRP levels can be used to assess the prognosis of hospitalized CAP patients 31 . In a prior investigation, CRP was found to be an independent predictor of refractory mycoplasma pneumonia in children 32 . We found that CRP was a predictor of MPP in adults and was mildly elevated in MPP adult patients, which suggested a mild inflammatory response in patients.
In this study, tree-in-bud signs and bronchial wall thickening were more common in the adult MPP group than in the non-MPP group, which is consistent with previous reports 26,33 . MP adheres to the respiratory mucosal epithelium via an adhesion protein, and it releases toxins that directly damage the respiratory epithelium and cause thickening of the bronchial walls. When lesions are distributed around the bronchioles, mucus and other inflammatory substances block the terminal bronchioles and alveolar sacs, resulting in the tree-in-bud sign 34,35 .
A nomogram is a visual representation of a statistical model that allows for personalized prediction of the incidence of clinical events. In previous studies, nomograms have been used to predict MPP in children 36,37 . However, there are few studies on nomograms of MPP in adults. To our knowledge, this is the first nomogram www.nature.com/scientificreports/ developed and validated that can be used to predict the risk of incidence of MPP in adults. The six predictors included in this nomogram were derived from routinely collected clinical data, which are available within hours of admission, and this can help clinicians to more quickly identify adults at high risk for MPP. However, there were some limitations. First, this single-centre retrospective study suffers from an inherent selection bias. Second, we only applied internal validation to evaluate the model, and large-scale, multicentre, prospective studies are needed for external validation before it can be applied to clinical practice. Third, this study is based on the cohort findings of the Chinese population and may not be applicable to patients of other ethnicities. We encourage validation of this model in centres across ethnicities as well as in other regions.

Conclusion
In conclusion, age, body temperature, dry cough, dizziness, CRP, and tree-in-bud sign were independent predictors of MPP in adults. We constructed a nomogram with a reliable predictive power. The nomogram may be a powerful tool to assist clinicians in making personalized decisions.

Data availability
The datasets used during the current study available from the corresponding author on reasonable request.